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1, INTRODUCTION 

The population rate of humans around the world are growing rapidly every year. These phenomena 
will penetrate the increasing in the development and construction of building to fulfill human desire [1]. 
This rise also affects the electrical energy consumption among humans. Thus, concerns of energy 
consumption have been put into a high consideration recently in Malaysia. The energy consumption 1s 
necessary to be predicted to achieve energy conservative and reduce the cost [2-3]. On the other hands, 
a correct prediction of energy can be used for the future plans and provide security for the power system. 
The purpose of doing this study on consumption of energy is to minimize used of energy during the critical 
time [4-5]. Other than that, the prediction energy in buildings is very important to obtain Economical and 
environmental benefits It 1s also important to reduce Greenhouse gas emissions (GHG) such as carbon 
dioxide emissions [6-8] that can be considered as the main causes of global warming [9]. Several methods for 
predictions have been used to solve the load predicting problem. These methods can be categorized into time 
series and factor analysis [10]. This time series is used based on the historical load data. The factor analysis 1s 
based on several factors that can affect the energy consumption. Some of the factors are weather, numbers of 
human in the building and human behavior. However, the factor analysis is not easy to be implemented due 
to the factors related are hard to be cater to the energy consumption. 

Studies on the electrical energy consumption prediction has been done by numerous methods that 
can be categorized into artificial intelligence (AI) methods as well as others conventional methods. 
Conventional methods prone to be higher in error and less accuracy compared to the AI methods. 
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Conventional methods consist of three stages which includes input layer, hidden layer and output layer. 
This conventional method only allows one-way direction for signals to travel from input to hidden layer and 
then to the output. There is no feedback or loops [10]. It 1s a straightforward method that only involves input 
and output through hidden layer. 

The AI methods are currently more advance than the conventional methods due to the ability to 
solve lots of problems that happen nowadays. There are numerous prediction techniques such as Artificial 
Neural Network (ANN), Bayesian Belief Network (BBN), Fuzzy Logic and Recurrent Neural Networks 
(RNNs) [11]. More recently, the ANN has employed a deeper architecture called Deep Layered Neural 
Network (DLNN) which has been used widely in many applications [12-15]. 

Several reviews on the methods for forecasting energy using Artificial Intelligence have been made. 
Recently, several researchers study about the short-term forecasting of electricity consumptions in Palestine 
using Artificial Neural Networks (ANN) [16]. The results when using this method shows that it is 
categorized as a very important tool to perform a good prediction with least mean square error as short load 
forecasting model. Besides, a research on prediction on rural energy consumption in Hebei Province is 
carried out using Gray Theory Model [17], while Suyono et al. predict the availability of solar radiation using 
the Extreme Learning Machine (ELM) [18]. Another research used genetic algorithm (GA) and least square 
support vector regression (LSSVR) for forecasting clean energy consumption [19]. Based on the research, 
GA-LSSVR is an effective method to reduce the degree of over-fit and also can be used as other alternatives 
to other clean energy consumption forecast. Particle swarm optimization is another method that is used to 
forecast energy demand of Turkey [20]. The result of this research shows that particle swarm optimization 
technique gives better forecasts than ant colony optimization technique. 


2. RESEARCH METHOD 
2.1. Data Collection and Selection 

Data of energy consumption from tower 2 engineering building in U1TM Shah Alam were collected 
by using several smart meters. Data were collected on every | minute for about 2 months. Smart meter can be 
used as two ways communication between the meter and the central that is to collect and record the data in 
real time. Data selected must be choose based on the constraint that needs to be used for this prediction. 
For energy consumption prediction, 2 data have been selected as the input of the RNNs which is the voltage 
and current. The power data has been selected as the output of the RNNs. The input and output involve are as 
shown in the formula: 


P=IxV 


where: 
P = power in watt (W) 
I = current in ampere (A) 
V = voltage in volt (V) 
The input and output data must be related to get the desire results of prediction. The data used for 
training 1s Tuesday data consist of 1440 observations of data and for testing also consist 1440 observations. 


2.2. Actual and Normalized Data 

This work uses two types of data that are actual data of energy consumption and normalized data. 
The data that have been normalized are ranged between 0 to 1. The formula below was used to perform the 
normalization. 


data—min(data) 


(1) 


norm_data = —————————— 
max (data)—min(data) 


2.3. Recurrent Neural Networks (RNN) and Nonlinear Autoregressive Neural Network with External 
Input NARX 

Recently, there has been considerable interest in the application of recurrent neural networks 
(RNNs) and NARX as methods for prediction. RNN and NARX has the ability to learn patterns from the past 
records and also to generalize and project the future load patterns for hidden data. Both RNN and NARX are 
networks with feedback connection. It has been proved that the proposed RNN gives a relatively accurate 
results for prediction [1, 21-22]. This technique is applicable to variety of problem such as speech 
recognition, language modeling as well as translation. In electrical field, RNN and NARX is commonly used 
for prediction of energy consumption, energy efficiencies and load demand. Recent study the RNN and 
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NARX method is developed in order to forecast the energy consumption in [23-25]. This model can predict 
energy consumption and will help in developing highly applicable energy policies in Iran. Another research 
in 2014 by Chen Fang using Elman Recurrent Neural Network (ERNN) applied on specific humidity 
forecasting. This model of ERNN builds to predict the specific humidity from three weather stations. 
The results on this research shows that this ERNN model has great potential than Multilayer perceptron 
(MLP) during winter season when the data noise produced is very small [26]. This result shows that ERNN 
can be an alternative method to predict specific humidity. 

Based on Figure 1, this work uses 2 inputs, | output and 10 and 20 hidden layers for RNN and 
NARX respectively. NARX network has and external input with delay of 20. Both the RNN and NARX are 
trained using the Levenberg-Marquardt (trainlm) for as training function. The construction of network 
models in MATLAB is shown below. 


Layer Recurrent Neural Network (view) 


Input(t) 








Figure 1. Construction of RNN (top) and NARX (bottom) models used in this work 


RNN: layrecnet(layerDelays,hiddenSizes,trainFcn) 
NARX: narxnet(inputDelays,feedbackDelays,hiddenSizes,trainFcn) 


where: 
layerDelays = row vector of increasing 0 or positive delays (default 1:2) 
hiddenSizes = row vector of one or more hidden layer sizes (default =10) 
trainFcn = training function (default = ‘trainlm’) 
The procedure of training and testing are explained in steps below: 
a. Load the energy consumption data collected on Tuesday 25" July 2017 that consist the value voltage 
and current as input, while the value of power as output 
b. |The data were trained using layer recurrent neural network (layrecnet) and NARX (narxnet). The delay 
used 1s 3 for RNN and 20 for NARX. 
c. After training, the predicted data was compared with the actual data. 
d. The training result was analyzed using histogram for the error and means square error (MSE) value was 
calculated. 
e. Load the energy consumption data collected on Thursday 27" July 2017 for testing. 
f. The test result has also been analyzed using histogram for the error and means square error (MSE) value 
was calculated. 


2.4. Process Flow 

The flow chart shown in Figure 2 1s the overall process of RNN and NARX method for prediction of 
energy consumption in Tower 2 Engineering Building in U1TM Shah Alam. The overall process consists of 
the data collection and selection based on the constraint, training process, testing process as well as 
comparison and bench-marking of the result. 
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Figure 2. Flow chart for RNN and NARX implementation 


3. RESULTS AND ANALYSIS 

In this work, a trial and error techniques has been used to get the suitable number of hidden layer for 
both RNN and NARX networks. For RNN implementation, we simply start with five hidden layers and 
increases by five layers up until twenty layers. For NARX, we start with five hidden layers, and increase the 
layers gradually by 5 layers up to 25 layers. Then, the training performance for RNN and NARX 
implementation are collected based on the Means Square Error (MSE) values. Based on the results tabulated 
in Table | and Table 2, MSE for training generally decreases when the number of hidden layer increases. 
Thus, it can be concluded that for training, the number of hidden layers is inversely proportional to the MSE. 
The lowest MSE recorded for RNN and NARX is at 10 hidden layers and 20 hidden layers respectively, 
which yield MSE of 0.5689 and 37.3488 respectively. For RNN the highest MSE recorded is at 5 hidden 
layers which is 1.4065, indicating that too small hidden layers are unable to learn the underlying patterns. 
However, selection of the number of hidden layers should be determined carefully, since when the number of 
hidden layers are more than necessary, the network will overfit and fail to learn the inherent pattern. Instead 
it will generalize too well on the training data and fail on testing data. Based on the Table 1 and Table 2, 
the optimal number of hidden layers used in this work for RNN and NARX 1s 10 and 20 respectively. 


3.1. Test Results Using Actual data 

The implementation of RNN and NARX models for prediction contains two test parts, where we 
report the results using an actual data and normalized data. For the first part, the result for test data was 
obtained through the graph in Figure 3, Figure 4 and Figure 5. The actual output is plotted versus the 
predicted output on the same graph. We can observe that RNN model gave better predicted values when 
compared to NARX. RNN test performance reported as MSE value is 55.834, where NARX performance is 
58.145. Based on Figure 4, comparison between the actual and predicted values shows some errors are 
apparent. For an ideal model, the predicted plot would overlay the actual plot completely. The errors for the 


Indonesian J Elec Eng & Comp Sci, Vol. 17, No. 3, March 2020: 1215 - 1223 


Indonesian J Elec Eng & Comp Sci ISSN: 2502-4752 O 1219 


test result between actual output and predicted output were shown for both RNN and NARX in histogram 
graph in Figure 5. The histogram graph can be used to record the frequent numbers of errors in the testing 
result. Based on Figure 4 the error for RNN test result shows that it lies between -6 to 4 and the most 
distribution is at 0. While for NARX, the error has a wider spread between -30 to 30, indicative that RNN 
performs significantly better than NARX for actual data usage. 


Table 1. Selection of Hidden Layer based on MSE Performance for RNN 


Number of hidden layers MSE 
RNN RNN (Normalized) 
5 1.4065 0.4851 
10 0.5689 0.5340 
15 0.7990 0.7219 
20 0.8696 0.8836 


Table 2. Selection of Hidden Layer based on MSE Performance for NARX 


Number of hidden layers MSE 
NARX NARX (Normalized) 
5 41.3837 37.4890 
10 38.1679 35.6996 
15 43.7693 39.4053 
20 37.3488 38.3441 
Z5 40.3913 31.5778 
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Figure 4. Magnified view of test result showing actual and predicted data of power (kWh) 
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Figure 5. The errors between actual and predict data illustrated as error histogram 


3.2. Test result using normalized data. 

For the second part, the result of test data was obtained from normalized data. The results are shown 
in Figure 6, Figure 7 and Figure 8. Likewise, the actual output is plotted versus the predicted output. 
According to Figure 7, RNN model yields better prediction than NARX model. The outlying normalized and 
predicted data is better in RNN case, while for NARX, the prediction contains large errors. In fact, 
the performance of this RNN in term of MSE is 42.215, while for NARX the performance is 60.267. 
For RNN model, the shape of predicted graph lies below the actual graph. The errors for the testing result 
between actual output and predicted output were shown in histogram graph in Figure 8. Based on Figure 7 
the error for RNN in this test result shows that it lies between -10 to 10 and the most distribution is at 0. 
Larger error observed in NARX model where the error is spread between -20 to 40. 
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Figure 6. Test result showing the actual and predicted data of power (kWh) 


Further analysis on the results compares the MSE performance for both RNN and NARX model in 
actual and normalized data usage cases. This is tabulated in Table 3. Based on Table 3, the best overall 
performance is achieved by RNN model which uses normalized data, where it delivers lowest error of 
42.215. The worst model 1s NARX model using normalized data, which yields MSE of 60.267. Figure 9 
shows larger magnification of RNN model prediction, to highlight its accuracy. 


Table 3. MSE performance for RNN and NARX on test data 


Data RNN NARX 
Actual 55.834 58.145 
Normalized 42.215 60.267 
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Figure 7. Magnified view of test result showing actual and predicted data of power (kWh) 
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Figure 8. The errors between actual and predict data illustrated as error histogram 
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Figure 9. Larger magnification of RNN model result, showing relatively accurate prediction 


4. CONCLUSION 


In In this paper, two models namely RNN model and NARX model was trained for prediction of 
energy consumption in Tower 2 Engineering Building at Ui1TM Shah Alam. Two historical data sets have 
been used and analysed on this model for data of energy consumption which are data collected on Tuesday 
25th July 2017 and on Thursday 27th July 2017. The prediction process involves three stages which is the 
selection of data based on constraint, training, as well as testing. Then, to select the best hidden layer, 
trial and error technique is used. The best number of hidden layers is 10 for RNN and 20 for NARX, which is 
according to experiments conducted. The results obtained after training and testing for both actual and 
normalized data, shows that RNN model with normalized data delivers the best performance with MSE of 
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42.215. Further enhancement to the work here can be achieved by using larger data such as data of a month 
or of a year energy consumption of this building. Other than that, continuation of this study on a new method 
for prediction that may give more accurate results for energy predictions. 
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